Sync WebGPU bridge to llama.cpp b9016 by leehack · Pull Request #6 · leehack/llama-web-bridge

leehack · 2026-05-08T18:32:26Z

Summary

update bridge CI and publish defaults from the old llama.cpp pin to b9016
include the current MTMD image/source layout required by newer llama.cpp builds
make CI build and upload the same base + mem64 artifact set that publish emits
forward native-compatible load options through the JS/C++ bridge ABI (n_seq_max, flash attention, KV cache type, KV-unified, RoPE, split mode, main GPU)

built locally against llama.cpp b9016 with WEBGPU_BRIDGE_BUILD_MEM64=1
node --check js/llama_webgpu_bridge.js
git diff --check
source tag v0.1.12 published successfully
published leehack/llama-web-bridge-assets@v0.1.12 with manifest llama_cpp_tag: b9016 and source commit 4048425c9268e7e9aa330364179bcc567d7d306d

leehack added 3 commits May 8, 2026 14:15

chore: sync bridge to llama.cpp b9016

4a1abc4

Add web bridge native load option parity

4048425

Merge remote-tracking branch 'origin/main' into sync-webbridge-b9016

d28fbd7

leehack merged commit ab2a6d7 into main May 8, 2026
1 check passed